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ABSTRACT 

We construct a set of model spectra specifically designed to match the colours of the 
BOSS CMASS galaxies and to be used with photometric redshift template fitting 
techniques. As a basis we use a set of spectral energy distributions (SEDs) of single 
and composite stellar population models. These models cannot describe well the whole 
colour range populated by the CMASS galaxies at all redshifts, wherefore we modify 
them by multiplying the SEDs with for A > for different values of Xi and jS. 
When fitting these SEDs to the colours of the CMASS sample, with a burst and dust 
components in superposition, we can recreate the location in colour spaces inhabited 
by the CMASS galaxies. Erom the best fitting models we select a small subset in a 
two-dimensional plane, whereto the galaxies were mapped by a self-organizing map. 
These models are used for the estimation of photometric redshifts with a Bayesian 
template fitting code. The photometric redshifts with the novel templates have a very 
small outlier rate of 0.22%, a low bias (Az/(1 + z)) = 2.0 • 10“^, and scatter of 
= 0.026 in the restframe. Using our models, the galaxy colours are reproduced 
to a better extent with the photometric redshifts of this work than with photometric 
redshifts of SDSS. 

Key words: galaxies: distances and redshifts, galaxies: evolution, galaxies: funda¬ 
mental parameters (colours) 


1 INTRODUCTION 


Spectroscopic surveys provide very precise measurements of 
the cosmological redshift, but they are time consuming and 
cannot be applied to fainter galaxies. Significantly larger 
volumes of the cosmos can be probed on shorter timescales 
with photometric surveys by observation through selected 
filter bands. Since spectral features cannot be resolved by 
medium- or broadband photometry, one has to apply statis¬ 
tical methods to derive the photometric redshift (photo-z). 
Redshift measurements are necessary in many cosmological 
contexts whenever information about the redshift tomogra¬ 
phy, or the distribution of galaxies as a function of redshift is 
required. The results of cosmological applications strongly 
depend on the photometric redshift accuracies. Large bi¬ 
ases and scatter in the photo-zs can deteriorate any such 
study, e.g., the dark energy constraints from shear tomog¬ 


raphy (Ma, Hu & Huterer 2006), or the baryonic acoustic 
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oscillation scale ( Benftez et al.|[2009| Sanchez et al. 2011). 
However, statistical errors can be accounted for if they are 
well known (e.g., Huterer et al.|2006 ). Furthermore, in oppo¬ 
sition to a single photo-z estimate, including the probability 
density function (PDF) P{z) in the analysis of the photo-z 
uncertainties enhances the accuracy of cosmological mea¬ 
surements (e.g., |Mandelbaum et al.|2008 Hildebrandt et al. 


2012 ). 


The techniques for photo-z estimation are commonly divided 
into two categories: template htting and empirical methods. 
Empirical methods learn a relation between the photomet¬ 
ric observables and the spectroscopic redshift (spec-z) of a 
training set of galaxies in order to employ that relation to 
objects without spectrosco pic information (e.g., [Collister 
Lahav|[^004| |Gerdes|[2009| |Carrasco Kind Brunner||2013[ 

2Q14| ). These techniques have the great advantage that they 
take calibration errors explicitly into account, and that they 
can include photometric observables other than magnitudes 
and colours. Nevertheless, they strongly rely on the training 
sample which has to be a good representation of the query 
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galaxies to yield accurate photo-zs. Furthermore, empirical 
methods have to be trained anew when they are applied to 
different surveys, filter systems, or extracted magnitudes or 
fluxes. Also, many empirical methods do not take photomet¬ 
ric measurement uncertainties into account. 

In contrast to empirical methods, template fitting techniques 
essentially perform a maximum likelihood fit to the data 


(e.g.,|Arnouts et al.|1999| 

|Benftez|2000| Bolzonella, Miralles 

& Pell6|2000||Ilbert et al.| 

|2006| Feldmann et al.|2006||Bram- 

mer, van Dokkum & Goppi|2008|). A set of template spectral 


energy distributions (SEDs) are shifted to several redshift 
steps, where they are multiplied with the filter functions 
and integrated over. The such predicted fluxes of the tem¬ 
plates are then fitted to the photometry of the data. There¬ 
fore, template fitting techniques can be applied to data from 
any photometric system, and, concurrently to the photo-z 
estimation, also provide restframe properties. In order to 
predict the photo-zs to a high precision, the data has to 
be well calibrated (or a catalogue with precise redshift mea¬ 
surements has to be available to re-calibrate the zeropoints). 
Furthermore, it is of paramount importance that the under¬ 
lying model SEDs represent the data in question. In order to 
achieve that, some codes use combinations of model SEDs, 
or “repair” the templates (e.g., Csabai et al.|2003 Feldmann 


et al. 2006 Brammer, van Dokkum & Coppi 2008). How¬ 


ever, the underlying SEDs have to be already well selected 
if a modification of them should succeed as a representation 
of the data. Moreover, if one introduces templates that do 
not match the data, the quality of the photometric redshifts 
is deteriorated. Therefore, we generate in this work a model 
set designed to match the data in question. 

In [Greisel et al. (2013 hereafter G13) we created model 
SEDs for spectroscopically observed luminous red galaxies 
(LRGs) from the Sloan Digital Sky Survey data release 7 
(SDSS-II,|York et al.|2000 Eisenstein et al.|2001 Abazajian 


|et al.|2009| ) . These models were then used for the estimation 

of photo-zs and yielded accurate results. The LRG sample 
of SDSS-II included LRGs only up to redshift z 0.5. In 
this paper we extend the work done in G13 and generate a 
set of model SEDs on the basis of the GMASS sample of the 


Baryon Oscillation Spectroscopic Survey (BOSS, Eisenstein 
et al.|20lT Dawson et al.|201^ Ahn et al.|2014 ). These tem¬ 
plates are created on the basis of the colours of the GMASS 
galaxies. They are specifically designed and selected to pro¬ 
duce accurate photometric redshifts with template fitting 
techniques. 

This paper is organized as follows. We present the template 
fitting photometric redshift code used in this work in Sec.|^ 
In Sec. I^the data used in this work is described. We briefly 
explain the SED fitting routine in Sec.|^ After that, we will 
go into detail about the generation of model SEDs and how 
we modify them in order to fit the data colours to a better 
extent. At the end of Sec. Owe select a model set which is to 
be used with template fitting photometric redshift codes. In 
Sec. we present the photometric redshift results of our new 
models and compare them with the two different photo-zs 
available on the SDSS database. We also predict the colours 
of our models when fitted to the more accurate one of the 
SDSS photo-zs in comparison to their prediction with the 
redshifts of this work. Finally, we give a summary and con¬ 
clusions in Sec. [H 

Throughout this paper we assume a AGDM cosmology with 


Qm = 0.3, Qa = 0.7 and Hq = 70kms“^Mpc“^. Magni¬ 
tudes are log^Q magnitudes given in the AB system (Oke & 
Gunn||1983 ). 


2 PHOTOMETRIC REDSHIFTS 

For photometric redshift estimation in this paper we use the 
Bayesian template fitting code PhotoZ ( Bender et al.|20^ ). 
Template fitting codes essentially determine the photomet¬ 
ric redshift by performing a maximum likelihood fit of the 
predicted colours of a set of template SEDs, calculated by 
the multiplication of the SEDs with the survey filter func¬ 
tions and integrated, to the observed galaxy colours at vary¬ 
ing redshifts. In order to lift degeneracies in the colours of 
model SEDs, Bayesian codes include the possibility to assign 
prior probabilities in redshift and luminosity to individual 
models. The resulting probability of a model-redshift com¬ 
bination then reads 


P(/Lt|C, m) oc C{C, m\fj,) • 

where C and m are the colours and magnitudes of the pho¬ 
tometric data, while pi denotes the absolute magnitude M 
and the redshift z. The second factor is the prior probability 
PMT) = Pm{M\T) • Pz{z\T), with the probability distri¬ 
bution functions (PDFs) of M and z for a template T. In 
the case of the PhotoZ code, the prior functions follow 


Px{x\T) oc exp ( - ln(2) 


X - Pa 


with X = z, M. px, (Xx, and px can be chosen for each tem¬ 
plate separately, while px is even. The individual photomet¬ 
ric redshift value of a galaxy is determined by the mode of 
the P{z) distribution Zmode of the best fitting model. We 
estimate the error of the individual photo-z prediction to 




-2^best ) 


■P{z) 


1/2 


( 1 ) 


Furthermore, we determine the stacked probability densities 
for all models in the template set which we will use in Sec.|^ 
in the calculation of photo-^ quality metrics. 

The PhotoZ code has been applied in the past to a variety 
of photometric catalog ues (|Drory et al.|2001[|Gabasch et aL 


2004 |Droryei^^lJ2005jjFeulnnr^e^^lJ2005|JBrimioull^^ 


2008 I Gabasch et al. 2008| [Greisel et ah |2013| Brimioulle 
et al. |2013| Gruen et al.||2013[ ~ 2014[ [Sanchez et al.|~14| ). 

It is also part of the PSl Photometric Glassification Server 
( Saglia et al.||2012 ). 


3 THE BOSS GMASS SAMPLE 

In this paper we use the spectroscopic data from the Baryon 


Oscillation Spectroscopic Survey (BOSS, Eisenstein et al. 
Dawson et al.||2013 ). The latest public data release of 


2011 


BOSS (DRIO, Ahn et al.|2014 ) targets 14,555 square degrees 
of the sky obtaining spectra for 1,848,851 galaxies. The two 
spectrographs used in BOSS are rebuilt from the original 


SDSS spectrographs (Smee et al. 2013) and cover a wave¬ 
length range of 3600 to 10,400 A at a resolution of 1560 to 
2650 A. BOSS was designed to measure the spatial distribu¬ 
tion of LRGs and quasars to investigate the baryon acoustic 
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oscillations that are imprinted on the large scale structure of 
today’s universe. Galaxies surveyed by BOSS have redshifts 
up to z ~ 1.0. The target selection is discussed in |Eisen-] 


stein et al. (2011). BOSS target galaxies are selected from 


SDSS-II imaging data in such a way that they have high 
luminosities and masses. Furthermore, the BOSS target se¬ 
lection requires approximate uniform stellar masses through¬ 
out the redshift range of 0.2 < z < 0.6. Above z ~ 0.6, the 
BOSS sample is magnitude limited. Similar to the cuts of 
the SDSS-II LRG sample ( Eisenstein et al.]|2001| Padman- 


abhan et al.||2005 ), a number of magnitude and colour cuts 


are applied to ensure the above requirements are fulfilled. 
They are selected in such a way to track the colours of a 
passively evolving galaxy from Maraston et al. (2009 M09 
hereafter). The BOSS sample is divided into two subsam¬ 
ples, a lower redshift sample tagged LOWZ at 2 : ^ 0.4, and 
a higher redshift sample, dubbed CM ASS for the constant 
mass requirement (Eisenstein et al. poTi] . 

The SDSS-III data can be acquired from the SDSS CasJobs 
websit^ For the selection of our sample we employ the 
SDSS clean photometry flag. This flag ensures that we do 
not have duplicates in our sample by removing multiple de¬ 
tections on different frames. Also, objects with deblending 
problems are removed, as well as the ones where more than 
20 % of the PSF flux is interpolated over (that is, only 80 % 
of the flux is actually detected). Additionally to that, we de¬ 
mand that objects were detected in the first pass (unbinned 
image), are not saturated, and that a radial profile could be 
constructed. 

We require furthermore that galaxies in our sample have 
spectroscopic redshifts determined to a high accuracy. 
Therefore, we chose only objects for our catalogue whose 
spec- 2 ; warning flag is equal to zero. We want to create mod¬ 
els specifically designed for galaxies at higher redshifts than 
in G13. Also, GMASS galaxies are very sparse at redshifts 
2 : > 0.8, which is why we reduce the sample to galaxies 
at 0.45 ^ 2 ; ^ 0.9. The resulting catalogue then contains 
486,934 GMASS galaxies. The CasJobs SQL query used to 
acquire this catalogue is given in App. ^ 

Fig.[T] shows the redshift distribution of the such selected 
GMASS galaxies. In the upper panel we plot the density of 
the sample in 2;spec versus absolute magnitude in the r band. 
Mr is derived by fitting the LRG models of G13 to the data 
at known spectroscopic redshifts. The lower panel of Fig. 
presents the frequency in spec- 2 :, normalised to an integral 
of one. 


3.1 Colours of SED Templates versus Colours of 
BOSS Galaxies 

In this section we want to compare the predicted colours of 
model SEDs to the colours of the BOSS GMASS sample. In 
Fig-Hl we show the case of the M09 LRG model which is 
used in the definition of the colour cuts of the BOSS galaxy 
sample. This model was created by M09 by adding a mass 
fraction of 3% of metal poor stars (Z 0 / 2 OO) to a single 
stellar population (SSP) model with solar metallicity from 
the Pickles (1998) stellar library in order to match the g — r 


and r—i colours of SDSS-II LRGs. We sample the M09 model 


^ http://skyserver.sdss3.org/CasJobs/ 
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Figure 1. Upper panel: Spectroscopic redshift versus absolute 
magnitude in the r band derived from a fit of GI3 LRG models 
to the CMASS catalogue of this work. Lower panel: Normalized 
frequency in spectroscopic redshift of the catalogue. 


at ages of I.O and 8.0Gyr and predict the colours while 
redshifting the SED (keeping the age fixed). Additionally, 
we consider two passively evolving variations of the MOO 
model with formation redshifts of Zf — I.O and 3.0. Since 
the MOO model is available only at distinct ages, we have 
to interpolate between them to determine the SEDs as a 
function of redshift. The passive evolution was computed 
with the EzGal software ( Mancone &: Gonzalez||20I2 ). MOO 
specifically created this model to match the colours of the 
SDSS-II LRGs, and it fits their median colours well (see also 
Fig. 5 in GI3, or Figs. I and 2 in MOO). This is also true for 
the colours of the GMASS sample, where we find that the 
model with Zf — I.O fits the u — g, g — r, and i — z colours 
best in the observed cases. In the case of the g — r and i — z 
colours there are deviations from the data for 2 : > 0.8, but 
also the sample size of the GMASS galaxies in this region 
decreases significantly. However, the predicted colours in r—i 
are too blue up to a redshift of 2 : ~ 0.72 and lie outside the 
median uncertainties of the data. Furthermore, the colour 
errors are too small to be responsible for the spread in colour, 
wherefore the colour values of the data is due to the galaxy 
population not the photometric uncertainties. 

Therefore, the data cannot be matched by the MOO model 
simultaneously in all colours, also not when taking evolution 
effects into account. The spread in colour can be explained 
by differences in ages of the galaxy, but could be also due 
to different stellar populations, i.e., the metallicity and the 
distribution of stellar ages. 


4 NEW SED TEMPLATES 

We aim to create a set of templates that can be used for pho¬ 
tometric redshift estimation of galaxies with similar proper¬ 
ties as the GMASS sample. To create models specifically 
designed to match the colours of the BOSS data, we fit a 
number of model SEDs to the data at their known spec- 
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Figure 2. In the upper panels of the four major panels the CM ASS colours are plotted as a function of redshift in grey, where the 
median is shown by a black dashed line. The lower panels are normalised to the median colour and show the residuals. Error bars show 
the median data errors in five redshift bins. On top of that the predicted colours of the M09 are plotted for several ages and formation 
redshifts. The purple and blue lines show the M09 model at constant ages of 1 and 8 Gyr. Furthermore, the green and red lines show a 
passively evolving version of M09 at formation redshifts of Zf = 1.0 and 3.0. 


troscopic redshift and select from the best fitting models a 
subset that should represent the data in terms of colours, 
while yielding accurate photometric redshifts. 

We expect the galaxy population to vary as a function of 
redshift. In the process of photo-z estimation with template 
fitting we can account for that by assigning different redshift 
prior probabilities to individual model SEDs. One could be 
tempted to use a huge amount of SEDs with different prop¬ 
erties that can match the data in question and yield reliable 
photo-z results (while heavily increasing the computation 
time). However, this is not the case since one has to deal with 
degeneracies in colours resulting from different galaxy prop¬ 
erties, e.g., age and metallicity. Also, introducing peculiar 
templates can deteriorate the P{z) of a galaxy. Therefore, 
one has to carefully select a small set of templates able to 
match the galaxy catalogue in question. In order to create a 
set of models that match the data at different redshifts we fit 
a variety of model SEDs to the CMASS galaxies within four 
redshift bins. The z bins are centred on z — 0.5, 0.6, 0.7, 0.8 
(continuing the sequence from G13) with interval widths of 
Az = 0.04. 

In the following we first give a short introduction of the SED 


fitting procedure and turn afterwards to the description of 
the models we used as a basis to construct new model SEDs. 
After that we explain how we select models to represent the 
data that should then serve as a template set for photomet¬ 
ric redshift estimation. 


4.1 Generating Model SEDs for Objects in the 
BOSS Catalogue by SED Fitting 

To fit model SEDs to the data we use the SED fitting soft¬ 
ware SEDfit ( Drory, Bender fc Hopp|[2004 ). The code fits 
a number of model SEDs to the colours of the data by per¬ 
forming a maximum likelihood fit. Additionally to the mod¬ 
els, one can define a burst model which is then fitted in 
superposition to the main component at several mass frac¬ 
tions. Eurthermore, the SEDfit code applies dust extinction 
to both, the main and star burst component, following the 


extinction law of Calzetti et al. (2000). We perform the SED 


fitting procedure in such a way that signal-to-noise ratios 
smaller than 1 are considered upper limits. 

The 4000 A break is the most significant feature in the spec¬ 
tra of red galaxies and the reason why we can estimate 
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photo-zs from broad band photometry to a high accuracy. 
At redshifts populated by the CMASS galaxies the 4000 A 
break lies within the r band, wherefore the g band is the 
bluest band needed to derive the position of the break in 
wavelength. The u band is very shallow and it may deterio¬ 
rate the quality of the SED fitting results and the photomet¬ 
ric redshifts if the errors are not determined accurately. For 
these reasons we decide to omit it in the following SED fit¬ 
ting and photometric redshift estimation. Furthermore, the 
SED fits are not performed on the whole data of a 2 : bin 
because of the high computation time, but we randomly se¬ 
lect subsamples from within each 2 : bin containing ~ 2000 
objects. 


4 . 1.1 SED Fitting Results with BC03 Models 


We tried a variety of available model SEDs in the SED fit¬ 
ting procedure. The most extensive public libraries originate 
from Bruzual & Chariot ( 2003[ hereafter dubbed BC03), 
Maraston ( 1998| 2005 hereafter M05), and Maraston 
Stromback (2011 hereafter Mil). We fit all of these models 
in separate trial runs in order to select a basic model set to 
proceed further, since large sets of models require enormous 
computation times. 

The BC03 models can be generated by the software 
GALAXE\(5 We create synthetical models from the BaSeL3.1 
library, using the Padova 1994 evolutionary tracks recom¬ 
mended by BC03 and the initial mass function (IMF) by 
Chabrier (2003). We generate models with four different 


metallicities, Z = 0.2, 0.4, 1, and 2.5 Zq. From these we 
produce SSP models, as well as composite stellar population 
(CSP) models. SSPs assume that all stars are formed instan¬ 
taneously in a delta-functional star burst at the birth of the 
galaxy (age zero) and are passively evolving afterwards. To 
create CSPs one can assign essentially any function for the 
star formation history (SFH), such that star formation takes 
place for longer periods of time. Usually, a star formation 
rate is assumed that behaves proportional to exp(—t/r) (at 
least for galaxies at lower redshifts 2 ; < 1), where t is the age 
of the galaxy and r is the e-folding time scale (e.g., |Shapley 


et al.|20Q5 Longhetti k. Saracco|2QQ9] ). We produce models 


with different r values, r = 0.5,1.0,3.0,5.0,8.0,20,50 Gyr, 
where the latter simulate an almost constant star formation 
rate (SFR). We sample the models at ages ranging from 
10 Myr to 13 Gyr. In order to exploit the maximum freedom 
available, we also create model SEDs with increasing SFR, 
hence r < 0. This kind of star formation history (SFH) is 
mostly important for high(er) redshifts {z > 2) and yields 
more physical results (e.g 


Maraston et al. 


2010 


Monna 


et al.|20l4 ), but was used for SED fitting also at lower red¬ 


shifts (e.g., Pforr, Maraston & Tonini 2012). In our case 
the increasing SFR models do not significantly increase the 
range in colour space needed to match the GMASS galaxies 
which is why we omit them in the following to save compu¬ 
tation time. 

We create the same variety of GSPs for the M05 and Mil 
models (at least where possible) using the stellar population 
synthesis code EzGal ( Mancone Gonzalez|[20T2 ), and fit 


^ http://www.cida.ve/~bruzual/bc2003 or http://www.iap.fr/ 
charlot/bc2003 
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Figure 3. Colour-colour plots for CMASS galaxies at 0.48 ^ 2 ; ^ 
0.52. The BOSS data is shown by grey shades and accompanying 
grey contours. Error bars denote the median error in colour in 
this redshift bin. The density of the resulting colours of the SED 
fitted unmodified BC03 models are shown by coloured contours. 
Both sets of contours are drawn at the same frequency levels, 5 %, 
25%, 45%, and 65%. 


them to the data as well. We show the distribution in likeli¬ 
hoods in Fig. |Al| in App. [A|for the SED fits of BC03, M05, 
and Mil models to the CMASS data in the four redshift 
bins. The BC03 models outperform the M05 and Mil mod¬ 


els in terms of their values marginally at most. Fig. Al 


shows that the fits with BC03 models have a slightly higher 
frequency of lower values than M05/11. Furthermore, 
considering Fig. 20 in G13, the BC03 models are a better 
match to the SDSS-II LRG data in terms of values (this 
is not so evident here, as seen in Fig. Al). Because of the 
versatility in the model creation with GALAXEV, and the va¬ 
riety of provided ages and metallicities, we choose to use the 
BC03 models in the remainder of this work. 


In Figs. to we present the SED fitting results with 
BC03 SSP and CSP models with increasing and decreasing 
SFRs in colour space. The data colour is plotted in grey 
scales with according contours, and the median colour er¬ 
rors are shown by error bars in each panel. The density of 
the predicted colours of the best fitting models are plotted 
over the data distribution in coloured contours. Both sets of 
contours are plotted at the same frequency levels. 

The bluer i — z colour range populated by the CMASS galax¬ 
ies in the redshift bins at 2 : ~ 0.5. 2 ; ~ 0.6 and 2 ; ~ 0.7 is 
not populated by the best fitting BC03 models. Addition¬ 
ally, the redder g — r colours at 2 ; ~ 0.7 are also not matched 
by the BC03 models. All these offsets in colour cannot be 
accounted for by the photometric errors which are also given 
in Figs. to We investigate the colour mismatch in the 
next section and modify the model SEDs to fit the data to 
a better extent. 
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Figure 8. Colour-colour plots of the CMASS data and model SEDs. The BOSS data is split into four equally sized redshift bins. From 
these bins we calculate the mean colour and plot it as points in the panel. The error bars denote the accompanying root mean square 
values of the colour in each bin. The points and the error bars are colour-coded with redshift (see the colour bar on the right, where 
white lines indicate the mean redshift in each bin). The red shaded track shows the colours of the model SED of Fig. with varying 
redshift, where the crosses are at the same 2 : as the data. Again, the redshift of the track is colour coded and shown in the colour bar. The 
green and blue shaded tracks originate from the same model, when multiplied by with (3 = 0.5,1 for A ^ 3000 A. A more detailed 
description is given in the text. 


4 . 1.2 Modification of the Red Continuum Slope 

In G13 we already showed in Fig. 19 that the predicted i — z 
colour of the SDSS-II LRG sample cannot be matched by 
the models for the highest redshift bin ~ 0.4, at least not 
while the other three colours are fitted simultaneously. The 
model colours were too red in comparison to the data, which 
means that the decrease in the continuum slope redwards of 
the 4000 A break of the model SEDs is not strong enough. 
The mismatch in z — z is also present for the GMASS sample 
which is located at even higher redshifts than the SDSS-II 
LRGs. 

The slope of the continuum is changed by variations in the 
physical properties of the models which we investigate in the 
following paragraph. It is unlikely that the choice of the IMF 
could affect the model SEDs in such a way to produce the 
colour mismatch we observe. Ghanging the Ghabrier IMF to 
a Salpeter (1955) or Kroupa (2001) IMF would only result in 


a change of the abundance of low mass stars. These should 
not have a great impact on the continuum slope, since the 
light in this part of the spectrum is dominated by red gi¬ 
ants. Optical colours are not sensitive to the choice of the 
IMF, which is (when stars are not resolved) often derived 
from spectroscopy or I R photometry (e.g., [Genarro et ^ 


served mismatch in colours. Different metallicities change 
the slope in the continuum as well, but we have considered 
sub- to super-solar values. Furthermore, we have exploited 
the model age and extinction as further parameters up to 
still reasonable values, i.e., 13 Gyr and Av = 3.0. The burst 
component significantly affects only the bluer parts of the 
spectrum and has only marginal impact on the red part, 
which is dominated by the main stellar population, and not 
by the small (in total mass) burst fractions. Lastly, the SFH 
has to be considered. We analysed GSPs with nearly con¬ 
stant SFR and SSPs which create all their stars at one in¬ 
stant in time. We even considered exponentially increasing 
SFRs. If the too shallow red continuum slope would be a re¬ 
sult of a poorly chosen SFH, we would at least expect that 
the data is bracketed by the considered models in colour 
space. 

Since we cannot isolate physical model parameters that 
cause the colour mismatch, we can only assume that the is¬ 
sue arises due to inaccuracies in the modelling of the stellar 
evolution phase most difficult to follow theoretically, i.e., the 
asymptotic giant branch (AGB). The wavelength ranges cov¬ 
ered by the colour mismatch hints at an incorrect modelling 
of AGB stars, since these dominate the SED in the red opti¬ 


cal and NIR parts of the spectrum. E.g., Zibetti et al. (2013) 


2003 Meidt et al. 2012 Gonroy van Dokkum 2012). In show through NIR spectroscopy (comparing BG03 and M05 


turn, the choice of the particular IMF cannot cause the ob¬ 
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Figure 4. Colour-colour plots for CMASS galaxies and best 
fitting BC03 models at 0.58 ^ 2 : ^ 0.62. See Fig. for a detailed 
description. 



Figure 5. Colour-colour plots for CMASS galaxies and best 
fitting BC03 models at 0.68 ^ 2 : ^ 0.72. See Fig. |^for a detailed 
description. 


models in their Fig. 1) that the chemical composition in¬ 
fluences the slope of the model SEDs in the NIR, i.e., that 
these differences are caused by different C/0 abundances. 
Furthermore, the different composition of dust in the cir- 
cumstellar envelopes of carbon and oxygen-rich AGB stars 
affect the efficiency of dust absorption and emission which 


can therefore greatly influence the colours (e.g., Marigo et al. 
2008 Cassara et al.||2013 Salaris et al.[|2014| ). Other expla¬ 
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Figure 6. Colour-colour plots for CMASS galaxies and best fit¬ 
ting BC03 models at 0.78 ^ 2 : ^ 0.82. See Fig. for a detailed 
description. The contours had to be smoothed to make them vis¬ 
ible due to the small sample size in this redshift bin. 


nations could be that the simple mixing-length theory mod¬ 
els describing the convection in stars in the AGB phase are 
insufficient to reproduce the colours, or that incorrect as¬ 
sumptions on mass loss leads to the deviation. 

A close investigation of the modelling of the AGB phase is 
needed to resolve the problem which is beyond the scope 
of this paper. However, we explain in the next paragraph 
how we modify the red continuum slope to better match the 
data. 

The red SED continuum follows a function proportional to 
, heavily modified by absorption lines. Therefore, we 
can also change the steepness of the continuum by varying 
P. This is done in such a way that we multiply the SED 
red wards of a wavelength \i with a • A“^, where a is chosen 
such that the fluxes of the underlying model SED and the 
modified SED coincide at \i . In this way we can change the 
continuum slope for A ^ Ai, wherefore the created SED on 
average follows \ We will term the such modified 

models “A“^ models” in the remainder of the text. 

We exemplary show the resulting SEDs of an LRG model 
from G13 when modified by a • A“^ for A ^ A^ = 4000 A 
and (3 — 0.5,1.0,1.5 in Fig. We also plot the SDSS fil¬ 
ters as positioned in wavelength in the observed frame at 
2 : = 0.5 and z = 0.9, the approximate extreme redshift val¬ 
ues of the GMASS sample. We can see that for A ^ 4000 A 
the change in slope mostly affects the i and z band at lower 
redshifts. For higher redshifts \i = 4000 A has moved in 
the r band. Therefore, we create variations of the BG03 
SSP and the GSP models with decreasing SFR explained 
above, with A^ G [3000,3500,4000,4500, 5000, 5500,6000] A 
such that the variation does affect the fluxes in different fil¬ 
ters for similar redshifts. Furthermore, the /3 values span a 
range from 0.5 to 2.0 with a step size of 0.5. Larger ranges 
for values of A^ and/or /3 do not improve the results further. 
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Figure 7. The black line is an LRG model SED from G13. The 
dotted blue, green, and red lines show the function with 

/3 = 0.5,1.0,1.5 for A ^ 4000A. The solid lines are the above SED 
multiplied by ^ also for A ^ 4000A. In grey (dark grey dashed 
line, and light grey solid line) the SDSS ugriz filter curves are 
shown for a galaxy at 2 : = 0.5, and 2 : = 0.8 respectively. 


since the location in colour spaces they would inhabit is al¬ 
ready covered by the other models or not populated by the 
CMASS galaxies. 

In Fig. 1^ we show the colours of the CMASS data (grey). 
We furthermore plot the colours of an exemplary SED from 
G13 (red), and the colours of the same model when modi¬ 
fied by for (3 — 0.5 and 1 (green and blue). The data 
in Fig. is split into four redshift bins, and we plot the 
mean colour and corresponding root mean square values of 
each bin. The points are colour-coded in such a way that 
they darken with increasing redshift, which is shown by the 
grey colour bar on the right of Fig. where white lines de¬ 
note the mean redshifts within the bins. On top of the data, 
the colour tracks as functions of redshift are plotted for the 
models. The model colours at the z values of the data are 
highlighted by crosses, and the tracks in the colour spaces 
are again colour-coded as a function of redshift. The track 
of the original model is plotted by varying intensities of red, 
whereas those of the corresponding models are plotted 
in green shades for 13 = 0.5, and blue shades for /3 = 1 re¬ 
spectively. All plotted models have Xi = 3000 A. While the 
data colours (grey) are well matched by the original model 
(red track) at lower redshifts, the deviations from the mean 
colour increase with increasing z. The modification by 
produces a better agreement with the mean data colours. 
The value of [3 necessary to fit the data best is somewhat 
ambiguous in this plot, but has to be decided individually 
for each galaxy together with the best fitting model. 


4 . 1.3 SED Fitting Results with modified BC03 Models 

We now introduce the modified versions of the BC03 models 
together with the original SSPs and CSPs with decreasing 
SFHs from Sec. 14.1.11 in the SEDfit code with the same 
fitting parameters as in Sec. |4.1.1| Figs. through show 
the SED fitting results in colour spaces of the CMASS data 
in the four z bins. We see that the contours of the data and 
the best fitting models (which are located at the same steps 
in frequency) almost coincide in Figs. |10[ and 

Comparing Figs. to with Figs. to we can 
observe an improvement on the match between the locations 
in colour space populated by the best fitting models and 
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Figure 9. Colour-colour plots for CMASS galaxies at 0.48 < 
2 ; < 0.52 (analogous to Fig. [^. The BOSS data is shown by grey 
shades and accompanying grey contours. Error bars denote the 
median error in colour in this redshift bin. The density of the 
resulting colours of the best fitting BC03 and BC03 X~3 models 
are shown by coloured contours. Both sets of contours are drawn 
at the same frequency levels, 5%, 25%, 45%, and 65%. 
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Figure 10. Colour-colour plots for CMASS galaxies and best 
fitting BC03 X~3 models at 0.58 < 2 ; < 0.62 (analogous to Fig.^. 
See Fig.j^for a detailed explanation of the plot. 
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Figure 11. Colour-colour plots for CMASS galaxies and best 
fitting BC03 models at 0.68 < 2 ; < 0.72 (analogous to Fig.[^. 
See Fig. for a detailed explanation of the plot. 


the data. This is additionally confirmed by Fig. |A1| in 
App. 0 where we see that the resulting values of the fits 
are more frequently found at lower values for the BC03 
models than for the original ones, hence improving 
the goodness of fit. Here, we want to specifically analyse 
the offsets in the g — r, r — i, i — z, and g — z colours of 
the best fitting BC03 models and their modified variations 
from the data. We show in Figs. to the deviations 
of the model colours, predicted by the best fitting SEDs, 
to the data, Acol = colmodei — coldata, for both setups 
and in all four considered z bins. The distribution in Acol 
of the original BC03 models (SSPs, as well as decreasing 
and increasing SFR CSPs) is presented by red histograms, 
where the Gaussian curve best fitting the histogram is 
shown by a dashed orange line. The colour offsets of the 
BC03 X~^ models are plotted by a grey histogram, with an 
according Gaussian drawn in black. Acol = 0 is shown by a 
dashed black line, and we print the parameter values of the 
Gaussians in the plots. 

While the mean deviations presented in Figs, to are 
more or less the same in all considered redshift bins, the 
standard deviation of Acol is about 2 to 3 times higher 
for the original BG03 SEDs. Furthermore, the BG03 X~^ 
distributions of Acol resemble a Gaussian much more 
closely than that of the original BG03, especially in the r — i 
and i — z colours, but also in g — z. We already pointed out 
that the i — z colour is overestimated by the BG03 models 
analysed in G13 for z ~ 0.4. This is still true for z ^ 0.5 
to 0.7, observable in Figs. 13 to (cf. Figs. to 11). 
Goncurrently, the predicted r — i colours of the unmodified 
BG03 models are too blue up to z ~ 0.6, and too red in 
z ~ 0.7. In contrast to that, the colours of the BG03 X~^ 
SEDs yield very symmetric distributions, although with 
increased flanks in comparison to a Gaussian for higher 



Figure 12. Colour-colour plots for CMASS galaxies and best 
fitting BC03 X~^ models at 0.78 < 2 ; < 0.82 (analogous to Fig.|^. 
See Fig.j^for a detailed explanation of the plot. The contours had 
to be smoothed to make them visible due to the small sample size 
in this redshift bin. This is done in the same way as in Fig. 


redshifts. 

In summary, we investigated two additional degrees of 
freedom, apart from the model SED parameters metallic- 
ity, SFH, age, burst, and dust, by modification of the red 
continuum slope through multiplying with at A ^ to 
match the colours of the CMASS galaxies to a better extent. 
When fitted to the data, the X~^ SEDs predict colours that 
deviate less from the data than the unmodified models they 
originate from, and yield lower values. We will use the 
variety of best fitting SEDs (including the additional burst 
and dust components) as a basis to select from when we 
define a model set for photometric redshift template fitting 
codes. 


4.2 Selection of Best Fitting SEDs for the New 
Template Set 

We want to select a set of model SEDs from the best fitting 
models of the previous section that represent the data in 
terms of colours for each redshift bin. The space we can 
construct from the SDSS colours is many-dimensional, and 
we want to reduce the dimensions for simplicity of the se¬ 
lection process. Therefore, we decided that the target space 
should have two dimensions, a compromise between the loss 
of information (which is greater for less dimensions) and 
the simplification through the reduction of dimensions. We 
could, in principle, perform a principal component analysis 
(PGA), and reduce the dimensions by concentrating on the 
space which is spanned by the first two eigenvectors of the 
PGA that have the highest variance. Another possibility 
for the reduction of dimensions is a self-organising map 


(SOM or Kohonen-map, Kohonen 1982 2001). A SOM is 
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Figure 13. Deviations in g — r, r — i, i — z and g — z colours for BC03 models and BC03 models from the data within 0.48 ^ 2 : ^ 0.52. 
The Acol distribution for the BC03 models are plotted in grey, and the best fitting Gaussian is indicated by a solid black line. The 
same distribution, but for unmodified BC03 models is given by a dark red line, where we plot the best fitting Gaussian by a dashed 
orange line. The black dashed vertical line highlights Acol = 0. The best fitting parameters of the Gaussian curves are given in the plots. 


an artificial neural network (ANN) that provides a discrete 
representation of a set of higher dimensional data values 
in a lower dimensional space (most often two-dimensional). 
The network is trained using unsupervised learning to map 
the data onto the surface in such a way that data points 
with similar properties (i.e., data values) are located in 
close neighbourhoods. Unlike a PCA, where the neglection 
of the third and later components yields a complete loss 
of the information carried by them, the two-dimensional 
SOM still entails this information in its points. This is 
why we chose a SOM over a PCA for the mapping onto a 
two-dimensional plane in the model selection below. 

We create maps of the galaxy catalogues from the g — r, 
r — i, and i — z colours of the four considered redshift bins. 
Furthermore, we can easily introduce also the absolute 
magnitudes M of the galaxies as a forth quantity to be 
mapped, since M can also hold information about the 


colour (e.g., Baldry et al. 2004). We have to take care 
of the errors in the data by normalising the colours to 
their mean value and dividing through the colour errors 
determined by the uncertainties in the photometry. The 
absolute magnitudes of the galaxies are calculated by 
fitting the LRG model SEDs of G13 to the data at their 


spectroscopic redshifts. From the best fitting models we 
extract the absolute magnitude in the SDSS r band, which 
has to be normalised to a standard normal distribution to 
be comparable to the colour values. Once this is done we 
train the SOM and create a surface with 30 x 30 discrete x- 
and y-values with assigned input values (i.e., colours and 
Mr). The positions of data points in the SOM are then 
determined by performing a nearest neighbour search. 

We use a SOM algorithm implemented in python and 
provided in the PyMVPA package (Hanke et al. 2009). 


Fig. presents the SOM trained on the catalogue with 
0.48 ^ 2 ; ^ 0.52 on the ^ — r, r — z, and i — z colours and 
on Mr. The four panels are representations of the same 
SOM, but with the values of the four mapped parameters 
g — r, r — i, and i — z colours and Mr encoded in colour. 
The density of the mapped underlying data are shown by 
contours. We can see from Fig. that the extreme values 
of ^ — r and r — i have the greatest separation in the map, 
which is due to these colours having the largest spread in 
values. This is not a result of measurement errors, since 
we took these into account through the normalisation of 
the colours explained in the last paragraph. We can see for 
example that the dense region in the middle of the panels 
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Figure 14. Deviations in g — r, r — i, i — z and g — z colours for BC03 models and BC03 A ^ models from the data within 0.58 ^ 2 : ^ 0.62. 
A detailed description of the plot is given in Fig. |13| 


corresponds to a higher concentration of the data at the 
respective colour values. These are g — r ^ 1.7, r — i ^ 0.9, 
and z — z ~ 0.45, which represent also the areas of highest 
density in the colour-colour plots of Fig. 


As previously mentioned, we aim to select a sample of 
model SEDs from the best fitting models of the last sec¬ 
tion, where we want to take these density variations in the 
population of the data into account. Therefore, we want to 
identify clusters in the mapped data to select a model from 
each cluster that should represent the galaxies within the 
same cluster cell in terms of colour and absolute r band 
magnitude. To perform the cluster search we employ a k- 
means clustering algorithm ( Steinhaus|1957 ) that partitions 
the two-dimensional maps into cluster cells. Each cell is a 
Voronoi bin ( Voronoi||19Q8 ), where two bins or clusters are 
separated from one another by a border orthogonal to the 
line connecting the cluster centres. Every data point be¬ 
longs to the cluster with the nearest centre. Specifically, we 
use the python /c-means clustering algorithm included in the 
scikit-learn package (Pedregosa et al. ][MT] ). 

Eig.p^ presents the bins of the /c-means clustering algorithm 
with /c = 6 clusters applied to the SOM of Eig.[^ where the 
cluster centres are indicated by black crosses. The density of 
the data in the SOM is shown by contours in the plot. The 


data exhibit a number of clusters in all redshift bins which 
does not exceed six, wherefore k = 6 was chosen. 

We want to select a set of SEDs from the best fitting model 
SEDs which we will use in the following as template set for 
photo-zs. Therefore, for each cluster bin we take all mod¬ 
els into account whose corresponding data points also lie 
in the same bin. Then, for each of these models separately, 
we estimate photometric redshifts on all objects within the 
cell and determine resulting quality parameters, i.e., mean 
error, scatter and catastrophic failures of the photo-zs. Eor 
each cluster we chose a model that yields the best photo-z 
results in terms of these parameters. Thereby, we have for 
each of the four redshift bins six models, hence 24 models in 
total. The such selected SEDs will be used in the following 
to estimate photo-zs. 


5 PHOTOMETRIC REDSHIFTS 

In this section we analyse the photometric redshift results 
we get using our novel templates in combination with the 
PhotoZ code on the whole CMASS sample detailed in Sec.[^ 
Afterwards, we compare our photo-zs with the photometric 
redshifts provided by the SDSS database. 
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Figure 15. Deviations in g — r, r — i, i — z and g — z colours for BC03 models and BC03 A ^ models from the data within 0.68 ^ 2 : ^ 0.72. 
A detailed description of the plot is given in Fig. |13| 


5.1 Photometric Redshifts with the Novel 
Template SEDs 

To introduce the models created in the previous section into 
the PhotoZ code, we define the z prior such that of a 
model SED is the central value of the z bin of the cat¬ 
alogue from which the model originates, wherefore fiz = 
0.5,0.6,0.7,0.8. As cr^, we set a default value of 0.2 which 
leads to a prior Pz{z\T) function wide enough to avoid fo¬ 
cusing effects at certain redshift bin centres, while ensuring 
a smooth transition between them. The resolution in red- 
shift of a photo-z run is 0.002 in the range of z G [0.0, 3.0]. 
The allowed redshift range is much wider than that which is 
populated by the CMASS galaxies, such that we can anal¬ 
yse if the photo-z accuracy is diminished by 2;phot values that 
are highly over- or underestimated. This is done because we 
would like to be able to run photo-z codes with the new mod¬ 
els on catalogues with galaxies from larger redshift ranges 
and with more variations in SED type. We therefore want 
to make sure that small errors in Zphot are not due to a re¬ 
striction in the redshift range. 

To improve the priors by adapting them iteratively, we anal¬ 
yse the outcome for subsamples of objects which are fitted 
best by a specific model. Thereby we can adjust the redshift 
and luminosity priors for each model SED in order to reduce 
outliers and bias. Essentially, we decrease the value of cr^ 


whenever a model yields lower Zphot accuracies for redshifts 
further away from its bin centre. Eurthermore, we allow 
to vary if the photo-z performance of a specific model can 
be enhanced. If we observe that a specific template provides 
very bad redshifts which cannot be resolved by adjusting the 
respective prior, we omit these models completely in follow¬ 
ing runs. This is mostly the case for models created from the 
highest two redshift bins. Since we chose the models only on 
account of their photo-z performance on a redshift bin, they 
might still yield a bad estimate in redshift ranges outside 
the bin. The resulting model set then consists of nine SEDs 
with adapted redshift priors. The luminosity priors where 
set initially to jiM — 20.0, ctm = 6.0, and pM — 6.0, to allow 
for a wide range of higher luminosities. The high exponent 
leads to a very flat functional behaviour within ctm, and to 
steep decreases in Pm{M) at /xm Si ctm- Adjusting Pm{M) 
in the iteration is not necessary, since we cannot detect out¬ 
liers which could be avoided through a different luminosity 
prior. 

The model SEDs are shown in Eig. |19| where we plot them 
in the wavelength range covered by SDSS at the redshifts of 
our catalogue 0.45 ^ 2 ; ^ 0.9. In the lower panel we present 
the redshift prior parameters /x^, and cr^ which correspond 
to the model SEDs. Eig. shows the colours of the nine 
models as a function of redshift. With the nine models we 
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Figure 16. Deviations in g — r, r — i, i — z and g — z colours for BC03 models and BC03 A ^ models from the data within 0.78 ^ 2 : ^ 0.82. 
A detailed description of the plot is given in Fig. |13| 


can account for the large spread of the data in most cases. 
Furthermore, we cover also the bluer parts in g — r, r — i, 
and i — z (cf. Fig.[^. For lower redshifts, our models produce 
colours that cover only the bluer r — i and i — z ranges of 
the data. This is not because the BC03 do not fit the 
colours of the data (cf. Figs, [^top^, but accidental, since 
the models were selected (from within their cluster bins) 
on account of their photo-z performance in Sec. |4.2| The 
model SEDs with designations 7 to 9 cover only peculiar 
blue colours of the data. These models were created on the 
basis of the higher z samples, which is why they match the 
data better at higher redshifts (especially the i — z colour). 
The bluer colours of the models can only be observed from 
a small number of galaxies. Therefore, when all models are 
fitted to the spectroscopic redshift (but also in the photo-z 
estimation below), the models 7-9 are best fitting only for 
~ 1 to 5 % of galaxies. 

The physical parameters and the and [3 values of the nine 
model SEDs are summarised in Tab.[^ They explain the be¬ 
haviour of the SEDs. Erom Eig. we see that the model 
SEDs roughly follow a trend and become bluer with increas¬ 
ing redshift (except for the red coloured SED with /Az = 0.8), 
which is mirrored in the values of Tab.^ The e-folding time 
scale T roughly increases, as well as the burst fraction, mak¬ 
ing the resulting SEDs bluer on average. The red highlighted 


SED has lower huxes than the orange and the yellow-green 
SEDs in the UV part of the spectrum, not continuing the 
sequence. It is redder because of the high extinction values 
and because of its (relatively) high metallicity, Z = Zq. 

In summary, although we can see a qualitative trend in 
the SEDs as a function of redshift prior (which originates 
from the spectroscopic redshifts of the underlying z bin with 
small adaptions), the trends in the physical parameters are 
not that evident. This is because they are degenerate and 
changes in one parameter can yield similar results in the 
SEDs as a variation in another parameter (e.g., the well- 
known age-metallicity degeneracy). 


Before analysing the photo-z performance of the new 
models, we want to introduce several metrics which we use 
to assess the photometric redshift quality. We decide to 
provide a large number of metrics to enable the reader to 
compare with other publications. The photometric redshift 
error is Az = Zphot - ^spec, and Azrf = Az/(1 + ^^spec) in 
the rest frame. Catastrophic outliers are dehned such that 
\Azrf \ ^ 0.15 (cf. [Ilbert et al.pOOG ). The mean errors are 
characterized by the bias (Az) and the mean absolute error 
(|Az|), as well as by their corresponding values in the rest 
frame, {Azrf) and {\Azrf\). The root of the sample variance 
is denoted a(Azrf), and cr68,r/ is the half of the width of the 
Azrf distribution where 68.27% of the sample is located. 
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Figure 17. SOM with 30 x 30 bins for g — r, r — i — z, and Mr for galaxies within 0.48 ^ 2 : ^ 0.52, where the parameter values are 
colour-encoded (see colourbars on the right of each panel). Mr is computed by fitting templates from G13 to the spectroscopic redshift. 
Contours show the number density of the data in the map (note that the contours are smoothed to improve the clarity of the figure) and 
are drawn at six equally distributed frequency levels from ~ 15 % (black) to ~ 85 % (white). 


Table 1. Physical parameters of the nine surviving templates. The first column gives the numbering of the models and the plot colour 
according to Figs. |19|and|20| The column ”burst“ is the mass fraction of the burst, whereas burst is the burst extinction. 


model # 


T [Gyr] age [Gyr] Av burst Av, burst Ai [A] f3 


1 (black) 

0.008 

SSP 

2.0 

1.2 

- 

- 

5000 

2.0 

2 (violet) 

0.004 

SSP 

6.0 

0.6 

1 . 0 % 

2.0 

5500 

1.5 

3 (blue) 

0.008 

1.0 

6.0 

1.0 

1 . 0 % 

0.0 

5000 

1.5 

4 (turquoise) 

0.05 

SSP 

8.0 

0.7 

1 . 0 % 

0.0 

3500 

2.0 

5 (dark green) 

0.02 

SSP 

4.0 

0.0 

1 . 0 % 

1.0 

5000 

1.5 

6 (green) 

0.05 

SSP 

3.0 

1.3 

2 . 0 % 

0.0 

3500 

2.0 

7 (yellow-green) 

0.05 

3.0 

5.0 

1.4 

1 . 0 % 

0.0 

5000 

2.0 

8 (orange) 

0.004 

50.0 

4.0 

2.7 

1 . 0 % 

0.0 

3000 

1.5 

9 (red) 

0.02 

3.0 

4.0 

2.2 

2 . 0 % 

2.0 

4500 

2.0 

. Icr confidence 

interval. Finally, in 

terms 

analyse the values 

of the 

mean 

(Az') 


of scatter we also calculate the normalised median absolute 
deviation ctnmad = 1.48 • median(|A^^r/I) (Ilbert et al. 


2006) which is calculated for non-outliers only, and gives 


a clue about the width of the Azrf distribution without 
regarding the flanks. We calculate the fractions ri2a, Tf^a 
of galaxies within (Azrf) =b i • a{Azrf), where z = 2,3 (cf. 
Carrasco Kind & Brunner] [20T4I 1. If Azrf behaved as a 
perfect Gaussian, then 2 • a{Azrf) and 3 • a(Azrf) would 
be the 2a and 3cr confidence intervals and therefore hold 
95.45 % and 99.73 % of the objects. But the distribution of 
Azrf can be non-Gaussian and still yield reasonable values 
for r]2a and r]3a (compare Figs.andlater on). 

To evaluate the precision of the photo-z errors 6 z estimated 
by the code (Eq. from the PDF (cf. Sec.[^, we introduce 
Az' = AzjSz. If the errors are estimated correctly, the 
distribution of Az' should resemble a standard normal 
distribution (cf. [Sanchez et al. 2014). We will therefore 


deviation a^z' of the distribution of Az'. 

Lastly, we perform Kolmogorov-Smirnov (KS) tests on the 
distributions iV(zphot) and P{zphot) (cf. [Carrasco Kind 


Brunner 


2014), where the latter is the Zphot distribution 


derived by stacking the PDFs for all objects. The KS 
test value is the maximum absolute difference between 
the cumulative distribution functions P{z)dz of the 
probability density of N(zphot), or the stacked P(2:phot), to 
the cumulative distribution of N{zspec)- 

After this introduction of the photo-z quality metrics 
we present the photometric redshift results of the CM ASS 
sample with the novel template set and priors of Sec. [4.2[ 
and Fig. in Fig. The upper left panel shows ^^spec 
versus 2;phot, the middle left panel shows the photometric 
redshift rest frame error as a function of spec-z, and the 
lower left panel presents ^^spec versus \Azrf\. We indicate the 
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Figure 19. Upper panel: The nine surviving model SEDs within the range of the SDSS ugriz filter system at redshifts 0.45 ^ 2 : ^ 0.9 
(the 2 : range of our galaxy sample). Lower panel: Corresponding 2 : prior parameters, jiz and cr^. The colour code is the same as in the 
upper panel. 


median Azrf and \Azrf\ values in the middle left panel by 
solid and dashed black lines, whereas in the lower panel we 
highlight the cr68,r/ and ctnmad values as a function of the 
spectroscopic redshift also by solid and dashed black lines. 
Finally, the right panel presents the normalised redshift dis¬ 
tributions for Zspec (grey hhed histogram), the distribution 
derived from the single-value photometric redshifts 2;mode 
(grey line) derived by the mode of P{z), and the stacked 
PDFs (black line). The photometric redshift quality metrics 
discussed above are printed in the plot as well. 

The bias has a small positive value for the lowest considered 
redshifts, then is close to zero, and is decreasing to negative 
values for higher redshifts z ~ 0.7. The overall mean value is 
still positive due to the small sample size at higher redshifts, 
visible by the number density in Fig. or the right panel of 


Fig. 21 The scatter (lower left panel in Fig. 21) increases 


slightly with increasing spec-z, while the value of ctnmad 
stays more or less the same. This means, that the out¬ 
liers (which are not considered in the calculation of ctnmad) 
are predominantly responsible for an increase in a(Azrf)- 


The fraction of catastrophic outliers however is very small 
?7 = 0.22 %. In the right panel of Fig.j^we observe that the 
photo- 2 : predictions from the mode of the PDF yield devia¬ 
tions from iV(2;spec) larger than for the case where the whole 
PDFs are considered. The excesses observed in A^(2;phot) are 
due to the overestimation mentioned previously. They are 
not anymore visible when we use the posterior P{z) distri¬ 
butions in the N(z) reconstruction, which means that the 
PDF should be favoured for science analyses (cf., e.g., 
delbaum et al.||20Q8| [Hildebrandt et al^|2012| ). 


Man- 


As for the error estimation of the PhotoZ code, we see from 
the values of {Az') and cr^^', that Az' behaves very close 
to a standard normal distribution, which is the goal in the 
photometric redshift error predictions. This means that not 
only the approach for the calculation of Sz is legitimate, but 
also that the models and priors create a reasonable P{z) 
(from which 6 z is extracted). 

Tab. in Sec. |5.2| holds a summary of the derived quality 
metrics, together with results of public photo-zs from the 
SDSS, which we will analyse in Sec. |5.2| 
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Figure 20. In the upper panels of the four major panels the CMASS colours are plotted as a function of redshift in grey, and the 
median is shown by a black dashed line. The lower panels are normalised to the median colour and show the residuals. Error bars present 
the median data errors in five redshift bins. On top of that the predicted colours of the nine selected models are drawn. The colour code 
matches that of Fig. |19| 


To investigate if the estimated errors are reliable in identify¬ 
ing outliers, we assume the null hypothesis that an object is 
a photo-z outlier. The probability of an outlier being falsely 
classified as a non-outlier on account of is then the type 
I error a. Additionally, the type II error [3 gives the prob¬ 
ability of a non-outlier being misclassified as an outlier by 
the estimated error. For the photo-zs of this work we get 
a = 0.93 and /3 = 3.5 • 10“^. This means that an outlier is 
falsely classified as a non-outlier with probability a, but also 
that a non-outlier is almost never classified as an outlier 
Although a{Az') is close to, but slightly smaller than, 1, the 
deviation is probably caused by to shallow peaks around the 
mode of P{z) for outliers, such that their errors are under¬ 
estimated. The a and /3 value are summarised together with 
the results of the SDSS (Sec. |5.2| ) in Tab. 

The distribution of the photometric redshift errors in 
the rest frame is shown in Fig. by a grey filled his¬ 
togram. In red we indicate the bias and the ranges of 
{Azrf) =b i • a(Azrf), where i = 1, 2, 3. Furthermore, we cal¬ 
culate the real interval widths where 95.45%, and 99.73% 
of the galaxies are located and introduce them in the plot 
(black dash dotted and dotted vertical lines). Finally, we 
calculate the Azrf distribution from the stacked PDFs and 


fit a Gaussian to the histogram (green lines). 

The peak of the Azrf distribution is slightly shifted to the 
right, which is again due to the overestimated photo-zs at 
the lowest redshifts. Concerning the resemblance to a Gaus¬ 
sian, we observe from the values given in the plot that 
c^ 95 ,r/ ~ 2a{Azrf) (which is why we cannot discern the 
lines in Fig. [2^ and crgg^r/ > 3a{Azrf). The latter is due 
to the outliers which can be scattered far from the spec-z 
value due to the allowed fitting range. The fractions r] 2 cr and 
T] 3 a- of objects within 2a(Azrf) and 3a(Azrf) have reason¬ 
able values. While r] 2 c 7 = 95.23% is very close to the de¬ 
sired value of 95.45%, r] 3 a = 98.98% is slightly smaller by 
only ~0.8%. The Azrf distribution derived from the PDFs 
is slightly broader than the distribution of the individual 
photo-z results. This is a consequence of the asymmetry of 
the PDFs that often have higher probabilities for redshifts 
greater than the most probable z. 

Fig, [^presents the deviations of magnitudes, predicted by 
the models in a photo-z run, and the data. A Gaussian is 
fitted to the histograms whose best fitting parameters are 
printed in the panels. For the r and i band magnitudes the 
Amag distributions are very narrow, while the expectation 
values of the Gaussians are near zero. This is thanks to the 
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novel templates 



Figure 21. Photo-^ results with novel templates, based on the BC03 models and selected through a SOM and a fc-means clustering 
algorithm. Left Panels: The upper panel shows the spectroscopic versus the photometric redshift, where the dashed line is at 2;phot = 2;spec- 
The middle panel is the distribution of the rest frame photo- 2 ; error as a function of the spectroscopic redshift, where we plot the bias 
(solid line) as well as the mean absolute error (dashed line). Finally, the lower panel shows 2;spec versus the absolute rest frame error, 
where we plot cres (solid) and ct^mad (dashed) on top. Right Panel: The relative frequency distributions of 2;spec (grey filled histogram), 
^phot (dark grey line), and the stacked PDF of all objects (black line). Additionally, the photo- 2 : quality metrics are printed in the figure. 
We provide two values for the KS test, for the predictions of single 2;phot values (from the mode of the PDF, grey), and for the PDF 
(black). 


depth of the photometry in these filters, which is greatest 
in i and r, and the photometry has the smallest measure¬ 
ment uncertainties. For the g and the 2 : band magnitudes 
however, the broadness of the Amag distributions increases, 
which is mostly an effect of the more shallow photometry in 
these bands, but also of the respective magnitude ranges of 
the galaxies. Furthermore, the mean magnitude deviations 
in these two bands are higher than in r and i. While fi of the 
best fitting Gaussian is still small in comparison to a for the 
g band, it is relatively high in the z band. We selected the 
models for the photo-z estimation only on account of their 
photo-z performance, and not on how well they match the 
data in terms of magnitudes. However, this does not nec¬ 
essarily imply that the BC03 models are not able to 
fit the data in terms of magnitudes. Indeed, calculating the 
average magnitude offsets of the SED fitting of Sec. Eni 
yield maximum values of [0.6, 0.9, 0.3,1.1] -10 ^ in g, r, z, 
and z for the four redshift bins. 

In the next section we want to compare the photometric 
redshift results of this section with the photo-zs provided 
by SDSS. 


5.2 Comparison to SDSS Photometric Redshifts 


We present the photometric redshifts from the SDSS-III 
database in comparison to the results of our code-template 
combination. The SDSS database provides photo-zs from 
two different empirical methods. One code uses a k-d tree 
nearest neighbour fit to derive the redshifts (hereafter KF 
after iCsabai et al. 1 120071). The SDSS-KF results for our 


catalogue are shown in Fig. which is equivalent to 
Fig. and also shows the quality metrics. From the middle 
panel of Fig. we see that the location of the densest 
part of the population lies close to the Az = 0 line for low 
redshifts z < 0.6. In all higher Zgpec regions the photometric 
redshift is systematically underestimated which leads to the 
low negative value in the bias. An underestimation is also 
present in the results from Sec. |5.H but it is slightly higher 
in the SDSS-KF case. From the lower panel of Fig. 
we see that also here the rise in aes.rf is mostly caused 
by outliers, which are not considered in the calculation of 
<7nmad (which stays more or less constant). Taking the 
estimated photometric redshift errors into account, the 
relatively high bias does not decrease, which is mirrored in 
the higher absolute value of (Az^), but even still increases. 
Hence, the bad estimates of the photo-zs are not recognized 
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Photo-z results from the KF approach published by SDSS. See Fig. |21|for a detailed description. 
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Figure 26. Photo-z results from the RF approach published by the SDSS. See Fig. for a detailed description. 
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0.48 <2 <0.52 



Figure 18. fc-means clustering of the SOM for 0.48 ^ 2 : ^ 0.52 
from Fig. |17| The Voronoi bins are highlighted by different 
colours, and cluster centres are marked by black crosses. Contours 
show the number frequency of the data (note that the contours 
are smoothed to increase the clarity of the figure) and are drawn 
at 10% intervals between 10% (black) and 90% (white). 


by the error estimation. This can also be seen in the value 
of a^z' which is well beyond 1 and means that the errors 
are on average underestimated. This also affects the results 
of a significance test on outlier classification (cf. Sec. [^. 
The errors type I and II read (at two simificant figures) 
a = 1.0 and (3 — 7.9 • 10“^ (see also Tab. ^ for comparison 
with the other results), confirming that the errors are 
underestimated. 

In terms of iV(zphot) the low photo-z values at higher Zspec 
do not change the shape significantly, which is due to the 
low sample size in these regions, and results in the low 
value of KS. We cannot compare to the N{z) results for 
PDFs, since they are not provided. 

As before, we show in Fig. the Azrf distribution. 
Comparing with our case from Fig.|^ we see that here the 
results are slightly shifted to the left, an effect due to the 
underestimated photo-zs at higher redshifts. Also in the 
case of the SDSS-KF redshifts, the cr 95 ,r/ and agg^rf values 
do not perfectly coincide with 2a(Azrf) and 3cr(Azr/), but 
are elevated. r] 2 a = 95.25% is very close to the desired 
value, while r] 3 a = 98.43% deviates by ~ 1.3%. 

In comparison to the redshifts we get with our novel 
templates and code, the quality metrics for bias, error, and 
scatter are all higher in the case of the SDSS-KF photo-zs. 
Furthermore, the estimated redshift errors are more reliable 
in our code-template combination. The KS test yields 
better results in the case of this work, since the SDSS-KF 
photo-zs are systematically underestimated. 


The second photometric redshift results published by 
the SDSS are from another empirical code, which uses 


random forests to predict photo-zs (Carliles et al. 2010 
hereafter SDSS-RF). Fig. 26 presents the results of the 


SDSS-RF code. In this case, the outlier rate is significantly 


novel templates 



Figure 22. Photometric redshift rest frame error distribution 
(grey filled histogram) estimated with the novel templates and 
priors. Az = 0 is highlighted by a solid black line, whereas the 
bias is shown by a solid red line. The steps at Azrf ± a(Azrf), 
2 a(Azrf), and 3a{Azrf) are represented by dashed, dash dotted, 
and dotted red lines, and the corresponding number fractions are 
given in the legend. We also plot a Gaussian with fi = (Az^/), 
and (j = a(Azrf) in grey. Furthermore, dash dotted and dotted 
grey vertical lines show the real crg^^rf and crgg^rf intervals. Since 
(P 95 ,rf ~ 2(7(Azr/) we cannot discern the red and the grey dash 
dotted lines. The Az^f derived from the stacked PDFs is shown 
by a green solid histogram, to which we fit a Gaussian highlighted 
by a dashed green curve. 
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Figure 23. Deviations in magnitudes of the predicted magni¬ 
tudes of the novel model SEDs to the data. 
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SDSS-KF 



Figure 25. Photometric redshift rest frame error distribution 
(grey filled histogram) estimated by the SDSS-KF method. A 
detailed explanation is given in Fig. |22| 


SDSS-RF 



Figure 27. Photometric redshift rest frame error distribution of 
the SDSS-RF published by the SDSS. A detailed explanation is 
given in Fig. 


higher than in both previous cases, as well as the mean and 
the absolute rest frame errors. Above z ~ 0.55 the photo-zs 
are systematically underestimated to a greater extent than 
in the former two cases, yielding the low bias. Furthermore, 
the scatter values cr(Azr/), (Jes.rf, and ctnmad have also 
increased in comparison to the results of this work, as well 
as compared to the SDSS-KF redshifts. 

In the distribution of iV(zphot) we can observe the lack of 


Table 3. Significance test for outlier classification. The null hy¬ 
pothesis is that an object is an outlier, and a and (3 are the type 
I and II errors respectively (rounded to two significance figures). 

this work SDSS-KF SDSS-RF 


q: 


0.93 

3.5 • 10-4 


1.0 

7.9 • 10-4 


0.86 

1.4- 10-2 


higher photo-z values which accumulate around z ~ 0.55. 
Because of that, the KS test value is higher. In terms of 
their estimated photometric redshift errors, the SDSS-RF 
method outperforms the SDSS-KF approach, but still 
produces a larger bias in (Az'). The errors are furthermore 
overestimated, visible in the ctaz' value which is lower than 
the desired value of 1. Again, this affects also the outlier 
classification derived from Sz. In the case of SDSS-RF 
photo-zs, the null hypothesis of an object being an outlier 
yields a = 0.86 and /3 = 0.015 (shown together with the 
previous results in Tab. [^. The higher values of Sz yield 
a smaller probability of an object being misclassified as 
a non-outlier, but also raises [3 (although only to a small 
value). So although the errors are overestimated on average, 
outliers are still misclassified at a significance level of 0.86. 
In Fig. 1^ we plot the Azrf distribution. The shift to 
the left is present to an even greater extent than in the 
previous case, and strongly deviating from a Gaussian 
with fi = {Azrf) and a = a{Azrf) also plotted in Fig. 27 
This is a result of the underestimation of the photo-zs. 
Furthermore, as in the previous cases, the value of cr 95 ,r/ 
is marginally higher than 2a(Azrf), while the deviation 
is greater in crgg^r/- On the other hand r] 2 a = 95.35% is 
even closer to 95.45% than in both previous cases, while 
r] 3 a = 97.93% deviates more strongly, ~ 1.8%. 


We present a summary of the photometric redshift qual¬ 
ity metrics of the results from the template fitting of this 
work and the two SDSS photo-zs in Tab.|^ The model-prior 
combination of this work produces the lowest outlier frac¬ 
tion, bias, and mean absolute error compared to the SDSS 
photo-z results. When considered as a function of spec-z the 
photometric redshifts derived in this work are slightly biased 
to lower Zphot values for higher Zgpec, but not to the same 
extent of the SDSS codes. Furthermore, the different scatter 
values calculated (i.e., a{Azrf), cres.rf, and ctnmad) are also 
lower for photo-zs of this work. The distribution of redshift 
errors are more similar to a Gaussian for the models, priors, 
and code of this work, which can be observed in Figs.[^[^ 


and 27 Gonsidering the number fractions within 2a{Azrf) 
and 3a(Azrf) all three codes perform similarly well. How¬ 
ever, this does not mean that the Azrf distributions are 
necessarily good approximations of Gaussian distributions 
with /i = Azrf and a = a(Azrf) which can be observed in 
Figs. and 

When we evaluate the similarity between the redshift distri¬ 
butions, from which spectroscopic and photometric redshifts 
are sampled, through a KS test, the photo-zs of this work 
yield the best results for predictions of the mode which are 
improved when considering the whole P{z) distribution. 
The results of a significance test with the null hypothesis be¬ 
ing that an object is a photometric redshift outlier is shown 
in Tab. Not one of the codes produces errors that can 
reliably predict outliers. Albeit the SDSS-RF produces the 
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Table 2. Summary of photometric redshift quality metrics of the template fitting results with the novel templates and priors used with 
the PhotoZ code, the SDSS photo-^s of the KF code, and the random forest code (SDSS-RF). Values for KS in the case of this work are 
derived from P{z), whereas the values in brackets are calculated using only the ^^mode predictions. 


setup 

n[%] 

(AZr-f) 


o-(Azrf) 

C^’NMAD 

<^68,rf 

(Az') 


KS 

this work 

0.22 

0.0020 

0.023 

0.031 

0.026 

0.026 

0.050 

0.98 

0.067(0.074) 

SDSS-KF 

0.91 

-0.011 

0.027 

0.038 

0.027 

0.029 

-0.54 

1.5 

0.082 

SDSS-RF 

3.0 

-0.023 

0.035 

0.051 

0.032 

0.036 

-0.27 

0.61 

0.14 


best a, this is mostly due to the overestimated errors (cf. 
Tab.|2]l. 

5.3 Deviations in Colour Predictions 

In the last section we analysed the photometric redshift re¬ 
sults with our code in comparison to the SDSS photo-zs. The 
redshifts from the SDSS-RF method were outperformed by 
those derived with the SDSS-KF code. We want to analyse 
if the photo-zs of this work can produce the CMASS galaxy 
colours to a better extent than if we fit the same models 
to the data at the photometric redshifts of the SDSS-KF 
method. 

Fig. ^S\ presents the differences between the model predic¬ 
tions and the data in the g — r, r — i, i — z, and g — z 
colours. The prediction of the photo-zs of this work are 
plotted in grey, the best fitting Gaussians are presented by 
black curves, and their parameters are given in the plot. 
When fitted to the photometric redshifts of th e SD SS-KF 
approach, the novel BC03 models from Sec. 4^ predict 
colours which are represented by blue histograms in Fig. |28| 
to which we also fit Gaussian curves. In all four colours, 
the standard deviation of the Acol distribution is larger for 
SDSS-KF. Furthermore, the mean deviation from the data 
colours is greater for SDSS-KF results in the i — z and g — z 
colours, g is of the same order in r — i for both cases. It 
is smaller for SDSS-KF photo-zs in ^ — r, but there both 
values are very low. The fact that the SDSS-KF r — i, z — z, 
and g — z colours are shifted bluewards from the distribution 
in colours of this work is a result of the SDSS-KF photo-zs 
being more heavily underestimated. All considered Acol dis¬ 
tributions resemble their best fitting Gaussians well in the 
inner parts, with increased flanks. However, this increase is 
more significant in for the SDSS-KF photo-zs in r — z. 
Therefore, the photometric redshifts derived in this paper 
not only generally have better quality metrics than the SDSS 
photo-zs, but can furthermore recreate the colours of the 
GMASS galaxies to a better extent. 


6 SUMMARY AND CONCLUSIONS 

In this work we created a set of model SEDs that are 
designed to match the colours of the BOSS CMASS 
sample and provide accurate photometric redshifts. We 
first analysed the colours of the LRG model of |Maraston| 
et al. (2009) which was created to match the g — r and 


r — i colours of the LRG sample of SDSS-II (Eisenstein 


et al. 2001), and found that we cannot use a singular age 


or evolution configuration that matches the data in all 
colours. Therefore, we created models for four redshift bins 
of widths Az = 0.04 centred on Zgpec = 0.5,0.6,0.7,0.8 
with the stellar population synthesis code of |Bruzual~fcl 


Charlot| (BC03, [2003 ). We generated SSPs and CSPs with 
decreasing SEHs at various metallicities, and sampled the 
models from a wide age range. These models were then 
fitted in superposition to a burst component and dust 
extinction to the data at known spectroscopic redshifts. 

We observe a mismatch in the colours of the data and the 
models which is due to a too shallow red continuum slope in 
the model SEDs. Large variations in the model parameters 
(i.e., metallicity, SEH, age, burst, dust extinction, IME) do 
not produce models that better fit the data. We speculate 
that inaccuracies in the modelling of the AGB phase could 
be the reason for the observed colour mismatch. In order 
to better recreate the colours of the GMASS galaxies we 
introduce additional degrees of freedom by modifying the 
model SEDs by multiplication with for A > with 
several values for \i and /3. We showed that the BG03 
models modified in this way indeed are a better match 
to the colours of the GMASS galaxies (Eigs. to to 


12 


and 


13 


to 


16) and also yield better x values from the 


fitting (Eig. m 
Erom these best fitting SEDs we selected a small subset 
that should cover the region in colour space and absolute 
magnitude in r of the GMASS sample. We therefore 
projected the GMASS galaxy colours and Mr of the four 
z bins onto two-dimensional planes using a self-organizing 
map. Afterwards we partition the plane in six clusters for 
each redshift bin with a /c-means clustering algorithm and 
select one model SED per cluster cell that produces the best 
photo-zs for galaxies within the same cell. We estimated 
photometric redshifts with a template fitting code and 
with the selected models and analyse their individual 
performance. Thereby we modified their redshift priors to 
improve on the photo-zs, but also decided to omit some 
of the templates which do not yield accurate photo-zs on 
the whole sample, regardless how the priors are modified. 
We then compared the photometric redshift results with 
the photo-zs of two empirical methods published by SDSS 
and calculated several metrics that assess the quality of 
the photo-zs, their estimated errors, and their distribution. 
We found that the photo-zs with the generated models 
of this work produce better values in all quality metrics. 
Eurthermore, we observed that including the stacked PDEs 
yield better results in the reconstruction of iV(zspec) mir¬ 
rored in the results of a KS test. Goncerning the estimated 
errors (5z, a significance test shows that none of the three 
considered 6z results provides a reliable classification of 
outliers. However, the probabilities of a non-outlier being 
misclassified is very small for all three considered cases. 
Einally, we compared the predicted colours of the novel 
model SEDs when fitted to the photo-zs of this work and to 
the better of the two SDSS redshifts, to the data. We found 
that the deviations from the data are smaller for photo-zs 
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Figure 28. Deviation in^f — r, r — i, i — z, and g — z colours for photometric redshift results with the novel templates (grey filled 
histogram) and model predictions when fitting the same templates to the SDSS-KF photo- 2 ;s (blue histogram). Gaussians are fitted to 
both distributions, and are shown by a solid black line and a dashed green line respectively. The best fitting parameters of the Gaussians 
are printed in the panels. 


of this work. 

The models of this work and G13 can be downloaded 
together with the priors at http://www.cefca.es/people/~ 
ngreisel/. We would be happy to provide the photometric 
redshifts on request. 
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APPENDIX A: SED FITTING RESULTS 


In this paper we create models that should match the BOSS 
CMASS galaxies in terms of their colours in redshift bins 
with widths 0.04 centred on z = 0.5, 0.6, 0.7, 0.8. We do so 
by htting sets of model SEDs with different properties like 
metallicity, SEHs, and ages to the data (Sec. |4.l!3 ). In su¬ 
perposition to the SEDs, a burst model and additional dust 
extinctions for the burst and the main component are fur¬ 
ther degrees of freedom. In Eig. |Al| we show the distributions 
of the values returned by the SED fitting code, when we 
fit the BC03, M05, and Mil models in equal setups to the 
data. The setup of the SED htting and the code are detailed 
in Sec. |4.1|an d is equal for all cases. 

In Sec. |4.1.3] we modify the BC03 models, changing their 
continuum slope by multiplying with a • for A ^ Xi 
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Figure Al. distributions of SED fitting runs (at 2;spec) detailed with BC03 (grey), BC03 (orange), M05 (blue), and Mil (green) 
models for the four considered redshift regions. The frequency is normalised and plotted logarithmically. We can see that although the 
distributions for BC03, M05, and Mil are similar, the values are improved by including the BC03 models. 


with various /3 and Xi values. This is done to introduce fur¬ 
ther degrees of freedom and to create models that match 
the colours of the data to a better extent. We fit the X~^ 
modifications of BC03 SSPs and CSPs with decreasing SFRs 
to the CM ASS galaxy colours with the sa me setup as men- 

The resulting x^ 


4.1.3 


tioned above and as detailed in Sec. 
values of the fit are shown together with the previous results 
of the BC03, M05, and Mil models in Fig. |A1| where they 
are highlighted by green histograms. 

While the results of the original BC03 models, M05, and 
Mil produce similar results in terms of the fits are sig¬ 
nificantly improved by using the BC03 X~^ models. This is 
also shown in terms of colours in Figs.[^to[^of Sec. 


APPENDIX B: QUERY FOR CM ASS SAMPLE 

We present here the SQL query executed to download the 
data from the SDSS-III CasJobs websit4E]used in this work 
and detailed in Sec.[^ In the following <fil> denotes any of 
the five Sloan filters and can be replaced by u, g, r, i, and 

z. 


select p.ObjID, s.z as zSpec, 

s.CModelMag_<fil>, s.CModelMagErr_<fil>, 
s.Extinction_<fil>, 

p.CModelFlux_<fil>, p.CModelFluxIvar_<fil>, 
pz.z as zPhot, pz.zErr as zPhotErr, 
pzrf.z as zPhotRF, pzrf.zErr as zPhotRFErr 
from specPhotoall s, spa a, Photoobjall p, 

Photoz pz, PhotozRF pzrf 
where s.Spec0bjID=a.Spec0bjID and 

s.ObjID=p.ObjID and pz.ObjID=p.ObjID and 
pzrf.ObjID=p.ObjID and 
s.zWarning=0 and s.z>0 and 
(a.BOSS_TARGETl & 2>0 or 

(a.B0SS_TARGETl & 1>0 and s.Tile>=10324)) and 
a.B0SSPrimary=l and a.zWarning_NoQS0=0 and 
a.Chunk!=^BOSSl^ and a.Chunk!=^B0SS2^ and 
p.Fiber2Mag_i<21.5 and p.Clean=l and 
(p.CalibStatus_<fil> & 1)!=0 and 
s.z>=0.45 and s.z <=0.9 


^ http://skyserver.sdss3.org/CasJobs/ 
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